Goto

Collaborating Authors

 condensed matter physic


CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

arXiv.org Artificial Intelligence

We introduce CMPhysBench, designed to assess the proficiency of Large Language Models (LLMs) in Condensed Matter Physics, as a novel Benchmark. CMPhysBench is composed of more than 520 graduate-level meticulously curated questions covering both representative subfields and foundational theoretical frameworks of condensed matter physics, such as magnetism, superconductivity, strongly correlated systems, etc. To ensure a deep understanding of the problem-solving process,we focus exclusively on calculation problems, requiring LLMs to independently generate comprehensive solutions. Meanwhile, leveraging tree-based representations of expressions, we introduce the Scalable Expression Edit Distance (SEED) score, which provides fine-grained (non-binary) partial credit and yields a more accurate assessment of similarity between prediction and ground-truth. Our results show that even the best models, Grok-4, reach only 36 average SEED score and 28% accuracy on CMPhysBench, underscoring a significant capability gap, especially for this practical and frontier domain relative to traditional physics. The code anddataset are publicly available at https://github.com/CMPhysBench/CMPhysBench.


AI and Physics: Hand-in-Hand Advancements

#artificialintelligence

Science and technology often facilitate one another; the latest discoveries in one will lead to new discoveries in the other. Along with innovations in engineering, medicine, and many other fields, this co-evolution can also be seen in physics. The continuing improvements in technology, in particular artificial intelligence (AI) and machine learning (ML), open doors for physics researchers to explore more precise and in-depth topics -- leading to new discoveries and a deeper understanding of our world. With roots in statistical mechanics, the mathematical foundation of AI development is shared with many branches of physics, making the two natural counterparts. Since "physics" is an extremely broad subject area and covers many different fields, each field may utilize AI differently.


Intelligent Machines are Teaching Themselves Quantum Physics - Motherboard

#artificialintelligence

Last year, Google's DeepMind AI beat Lee Sedol at Go, a strategy game like chess, but orders of magnitude more complicated. The win was a remarkable step forward for the field of artificial intelligence, but it got Roger Melko, a physicist at the Perimeter Institute for Theoretical Physics, thinking about how neural networks--a type of AI modeled after the human brain--might be used to solve some of the toughest problems in quantum physics. Indeed, intelligent machines may be necessary to solve these problems. "The thing about quantum physics is it's highly complex in a very precise mathematical sense. A big problem we face when we study these quantum systems [without machine learning] is how to deal with this complexity," Melko told me.


Researchers apply machine learning to condensed matter physics

#artificialintelligence

A machine learning algorithm designed to teach computers how to recognize photos, speech patterns, and hand-written digits has now been applied to a vastly different set of data: identifying phase transitions between states of matter. This new research, published today in Nature Physics by two Perimeter Institute researchers, was built on a simple question: could industry-standard machine learning algorithms help fuel physics research? To find out, former Perimeter Institute postdoctoral fellow Juan Cassasquilla and Roger Melko, an Associate Faculty member at Perimeter and Associate Professor at the University of Waterloo, repurposed Google's TensorFlow, an open-source software library for machine learning, and applied it to a physical system. Melko says they didn't know what to expect. "I thought it was a long shot," he admits. Using gigabytes of data representing different state configurations created using simulation software on supercomputers, Carrasquilla and Melko created a large collection of "images" to introduce into the machine learning algorithm (also known as a neural network).